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DETAILED ACTION 
Response to Arguments 

1. Applicant has submitted an amendment filed 1/10/2005 arguing to traverse the 
art rejection. Applicant's arguments have been fully considered but they are not 
persuasive. Regarding previous ground of rejection of claims 5-6, 10-12, and 16-18, 
Gorin et al. (US 6044337) fully anticipates every limitation presented in the forth- 
mentioned claims in that Gorin et al. disclose a speech recognition operable with a 
lexicon database containing a large number of utterances each of which is related to 
one of a predetermined set of routing services (col. 5, lines 30-55 or referring to figures 
3-4, the lexicon database containing speech models trained by prior to actual speech 
recognition). At the time of actual speech recognition, the speech recognizer analyzes 
the input speech to determine what is being requested by comparing the input speech 
with speech models stored in the lexicon database (co/. 5, line 30 to col. 6, line 40, or 
referring to the abstract, this process is well known in any speech recognition art). The 
speech recognizer recognizes and spits out the meaning of the input phrase for further 
processing (col. 6, lines 1-34). Applicant also argues "the keyword list includes words 
that are substantially significant for an application (e.g., "television" in an application for 
automated control home appliances)" (page 13 of the argument/remark section). 
However, Gorin et al. teach a speech recognition system that is able to recognize the 
meaning of input speech. In order to recognize the meaning of the input, the speech 
recognizer must first recognize all the input words that are semantically significant 
before assigning a meaning to the input phrase (example of the phrase "long distance" 
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in col. 4, lines 52-62 is "substantially significant for an application'). That is same as or 
equivalent with spotting keywords. Therefore, previous ground of rejection is 
maintained for claims 5-6, 10-12, and 16-18. 

2. Regarding the remaining claims, applicant agrees that Hedin et al. (US 6185535) 
disclose a client/server speech recognition system in that both the client device and the 
server are equipped with a speech recognizer {page 15 of the argument/remark 
section), but disagrees that Hedin et al. disclose "a system comprising a first speech 
recognition mechanism for recognizing a word sequence, from input speech data, 
based on a language model; and a second speech recognition mechanism for 
recognizing, when the first speech recognition mechanism fails, the word sequence 
based on at least one word detected from the input speech data" {page 15 of the 
argument/remark section). However, Hedin et al. teach all the limitations mentioned 
above in that the when the speech recognizer at the client device fail to recognize the 
input speech, the input speech is forwarded to a second speech recognizer located at 
the server {coL 4, line 66 to col. 5, line 33). Applicant further argues that "the two ASRs 
in Hedin each generate a recognized result which is further acted upon by either the 
client or server" and "Hedin does not disclose anything about the language moder{page 
16 in the argument/remark section). However, nowhere in the claim language of the 
independent claim 1 points out that the recognition result not being acted upon by either 
the client or server. Also, any speech recognition system must include acoustic and 
language models. As thus, previous ground of rejection is maintained. 
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Claim Rejections - 35 USC § 102 

3. The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that 
form the basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - (b) the invention was patented or described in a printed 
publication in this or a foreign country or in public use or on sale in this country, more than one year prior to 
the date of application for patent in the United States. 

4. Claims 5-6, 10-12, and 16-18 are rejected under 35 U.S.C. 102(b) as being 
anticipated by Gorin et al. (US Patent No. 6044337). 

5. Regarding claims 5, 10, and 16, Gorin et al. disclose a mechanism, method, and 
computer-readable medium encoded with a program for keyword based speech 
recognition, comprising: 

a keyword spotting mechanism for detecting, using at least one acoustic models, 
at least one keyword from input speech data based on a keyword list (col. 5, in. 1 to col. 

6. In. 34 and see figure 4)\ and 

a keyword based recognition mechanism for recognizing a word sequence using 
the at least one keyword, detected by the keyword spotting mechanism, based on a 
language model (col. 5, In. 1 to col. 6, In. 34 and see figure 4). 

6. Regarding claim 6, Gorin et al. further disclose that the system according to claim 
5, further comprising: an index mechanism for establishing indices to word sequences 
that are allowed by the language model based on the keyword list, the indices being 
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used by the keyword based recognition mechanism to recognize the word sequence 
(col. 5, In. 49 to col. 6, In. 34 and see figure 4). 

7. Regarding claims 1 1 and 17, Gorin et al. further disclose that the method and 
medium according to claims 10 and 16, respectively, further comprising: deriving at 
least one word sequence from the language model to generate a language associated 
with the language model {col. 5, In. 43-67)] and establishing indices to the at least one 
word sequence based on each word in the keyword list prior to the detecting (col. 5, In. 
49 to col. 6, In. 34 and see figure 4). 

8. Regarding claims 12 and 18, Gorin et al. further disclose that the method 
according to claims 10 and 17, respectively, wherein the recognizing comprises: 
identifying zero or more candidate word sequences, through the indices between each 
keyword in the keyword list to the at least one word sequence, that contain at least 
some of the at least one keyword (col. 5, In. 30-42); and determining the word sequence 
from the zero or more candidate word sequences that match the at least one keyword 
(col. 6, In. 1-34). 

Claim Rejections - 35 (JSC § 103 

9. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
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invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

10. Claims 1-4, 7-9, and 13-15 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Hedin et al. (US Patent No. 6185535) in view of Setlur et al. (US 
Patent No. 5956675), and further in view of Gorin et al. (US Patent No. 6044337). 

1 1 . Regarding claim 1 , Hedin et al. discloses a system, comprising: 

a first speech recognition mechanism for recognizing a word sequence, from 
input speech data, based on a language model (co/. 5, In. 7-33); and a second speech 
recognition mechanism for recognizing, when the first speech recognition mechanism 
fails, the word sequence based on at least one word detected from the input speech 
data (col. 5, In. 7-33). 

Hedin et al. do not disclose that the first speech recognition mechanism is a 
graph-decoder based speech recognition mechanism, and second speech recognition 
mechanism is a keyword-based speech recognition mechanism. However, Setlur et al. 
teach that the first speech recognition mechanism is a graph-decoder based speech 
recognition mechanism (co/. 4, In. 48 to col. 5, In. 23). 

Since Hedin et al. and Setlur et al. are analogous art because they are from the 
same field of endeavors, it would have been obvious to one of ordinary skill in the art at 
the time of invention to modify Hedin et al. by incorporating the teaching of Setlur et al. 
in order to improve recognition accuracy while reducing processing time and power. 

The modified Hedin et al. still do not disclose that the second speech recognition 
mechanism is a keyword-based speech recognition mechanism. However, Gorin et al. 
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further teach that the second speech recognition mechanism is a keyword-based 
speech recognition mechanism (col 5, In. 30 to col. 6, In. 34). 

Since the modified Hedin et al. and Gorin et al. are analogous art because they 
are from the same field of endeavors, it would have been obvious to one of ordinary skill 
in the art at the time of invention to further modify Hedin et al. by incorporating the 
teaching of Gorin et al. in order to speech up service request. 

12. Regarding claim 2, Hedin et al. further disclose the system according to claim 1 
that recognition acceptance mechanism for determining whether to accept the 
recognition result generated by the first speech recognition mechanism or to activate, 
when the recognition result from the first speech recognition mechanism is not 
accepted, the keyword based speech recognition mechanism (col. 5, In. 1-33, if not 
recognized, send to other speech recognizer). 

Hedin et al. fail to specifically disclose that a graph decoder based speech 
recognition mechanism comprises: a graph decoder for recognizing the word sequence 
from the input speech data based on at least one acoustic feature to generate a 
recognition result, the recognizing being performed according to at least one acoustic 
model and a language model. However, Setlur et al. further teach a graph decoder for 
recognizing the word sequence from the input speech data based on at least one 
acoustic feature to generate a recognition result, the recognizing being performed 
according to at least one acoustic model and a language model (col. 4, In. 48 to col. 5, 
In. 23). 
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Since Hedin et al. and Setlur et al. are analogous art because they are from the 
same field of endeavors, it would have been obvious to one of ordinary skill in the art at 
the time of invention to modify Hedin et al. by incorporating the teaching of Setlur et al. 
in order to improve recognition accuracy while reducing processing time and power. 

13. Regarding claim 3, Hedin et al. further disclose a system according to claim 2, 
further comprising an acoustic feature extractor to extract the at least one acoustic 
feature from the input speech data (Feature Vector Extraction 229 in figure 2). 

14. Regarding claim 4, Hedin et al do not disclose the system according to claim 2, 
wherein the keyword based speech recognition mechanism comprises: a keyword 
spotting mechanism, activated by the recognition acceptance mechanism, for detecting, 
using the at least one acoustic models, the at least one keyword from the input speech 
data based on a keyword list; and a keyword based recognition mechanism for 
recognizing the word sequence using the at least one keyword, detected by the 
keyword spotting mechanism, based on the language model. 

However, Gorin et al. further disclose a keyword spotting mechanism, activated 
by the recognition acceptance mechanism, for detecting, using the at least one acoustic 
models, the at least one keyword from the input speech data based on a keyword list 
(col. 5, In. 1 to col. 6, In. 34 and see figure 4)\ and a keyword based recognition 
mechanism for recognizing the word sequence using the at least one keyword, detected 
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by the keyword spotting mechanism, based on the language model (col. 5, In. 1 to col. 
6, In. 34 and see figure 4). 

Since the modified Hedin et al. and Gorin et al. are analogous art because they 
are from the same field of endeavors, it would have been obvious to one of ordinary skill 
in the art at the time of invention to further modify Hedin et al. by incorporating the 
teaching of Gorin et al. in order to speech up service request. 

15. Regarding claims 7 and 13, Hedin et al. disclose a method, comprising: 

a first speech recognition mechanism for recognizing a word sequence, from 
input speech data, based on at least one acoustic features, the recognizing being 
performed using at least one acoustic model and a language model (col. 5, In. 1-33)\ 
determining, by a recognition acceptance mechanism, whether to accept the word 
sequence or to activate a keyword based speech recognition mechanism (col. 5, In. 1- 
33, if not recognized, send to other speech recognizer)] and a second speech 
recognition mechanism for recognizing, when the first speech recognition mechanism 
fails, the word sequence based on at least one word detected from the input speech 
data (col. 5 7 In. 1-33). 

Hedin et al. do not disclose that the first speech recognition mechanism is a 
graph-decoder based speech recognition mechanism, and second speech recognition 
mechanism is a keyword-based speech recognition mechanism. However, Setlur et al. 
teach that the first speech recognition mechanism is a graph-decoder based speech 
recognition mechanism (col. 4, In. 48 to col. 5, In. 23). 
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Since Hedin et al. and Setlur et al. are analogous art because they are from the 
same field of endeavors, it would have been obvious to one of ordinary skill in the art at 
the time of invention to modify Hedin et al. by incorporating the teaching of Setlur et al. 
in order to improve recognition accuracy while reducing processing time and power. 

The modified Hedin et al. still do not disclose that the second speech recognition 
mechanism is a keyword-based speech recognition mechanism. However, Gorin et al. 
further teach that the second speech recognition mechanism is a keyword-based 
speech recognition mechanism (co/. 5, In. 30 to col. 6, In. 34). 

Since the modified Hedin et al. and Gorin et al. are analogous art because they 
are from the same field of endeavors, it would have been obvious to one of ordinary skill 
in the art at the time of invention to further modify Hedin et al. by incorporating the 
teaching of Gorin et al. in order to speed up service request. 

16. Regarding claims 8 and 14, the modified Hedin et al. do not disclose the method 
and medium according to claims 7 and 13, respectively, wherein the performing 
comprises: detecting, by a keyword spotting mechanism, at least one keyword, 
according to a keyword list, from the input speech data based on the at least one 
acoustic model; and recognizing, by a keyword based recognition mechanism, the word 
sequence using the at least one keyword, detected by the detecting, based on the 
language model. 

However, Gorin et al. further teach detecting, by a keyword spotting mechanism, 
at least one keyword, according to a keyword list, from the input speech data based on 
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the at least one acoustic model (col. 5, In. 1 to col. 6, In. 34 and see figure 4)\ and 
recognizing, by a keyword based recognition mechanism, the word sequence using the 
at least one keyword, detected by the detecting, based on the language model {col. 5, 
In. 1 to col. 6, In. 34 and see figure 4). 

Since the modified Hedin et al. and Gorin et al. are analogous art because they 
are from the same field of endeavors, it would have been obvious to one of ordinary skill 
in the art at the time of invention to further modify Hedin et al. by incorporating the 
teaching of Gorin et al. in order to speed up service request by eliminating traditional 
long-waited menu-based service selection. 

17. Regarding claims 9 and 15, Hedin et al. further disclose the method and medium 
according to claims 7 and 13, respectively, further comprising: receiving the input 
speech data (microphone 215 in figure 2); and extracting, by an acoustic feature 
extractor, the at least one acoustic feature from the input speech data (Feature Vector 
Extraction 229 in figure 2). 

Conclusion 

THIS ACTION IS MADE FINAL. Applicant is reminded of the extension of time 
policy as set forth in 37 CFR 1 . 1 36(a). 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within 
TWO MONTHS of the mailing date of this final action and the advisory action is not 
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mailed until after the end of the THREE-MONTH shortened statutory period, then the 
shortened statutory period will expire on the date the advisory action is mailed, and any 
extension fee pursuant to 37 CFR 1 .136(a) will be calculated from the mailing date of 
the advisory action. In no event, however, will the statutory period for reply expire later 
than SIX MONTHS from the mailing date of this final action. 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Huyen X Vo whose telephone number is 571-272-7631 . 
The examiner can normally be reached on M-F, 9-5:30. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Wayne Young can be reached on 571-272-7582. The fax phone number for 
the organization where this application or proceeding is assigned is 703-872-9306. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). 

HXV 6/1/2005 
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